We present a framework that couples the syntax and semantics of naturallanguage sentences in a generative model, in order to develop a semantic parserthat jointly infers the syntactic, morphological, and semantic representationsof a given sentence under the guidance of background knowledge. To generate asentence in our framework, a semantic statement is first sampled from a prior,such as from a set of beliefs in a knowledge base. Given this semanticstatement, a grammar probabilistically generates the output sentence. A jointsemantic-syntactic parser is derived that returns the $k$-best semantic andsyntactic parses for a given sentence. The semantic prior is flexible, and canbe used to incorporate background knowledge during parsing, in ways unlikeprevious semantic parsing approaches. For example, semantic statementscorresponding to beliefs in a knowledge base can be given higher priorprobability, type-correct statements can be given somewhat lower probability,and beliefs outside the knowledge base can be given lower probability. Theconstruction of our grammar invokes a novel application of hierarchicalDirichlet processes (HDPs), which in turn, requires a novel and efficientinference approach. We present experimental results showing, for a simplegrammar, that our parser outperforms a state-of-the-art CCG semantic parser andscales to knowledge bases with millions of beliefs.
展开▼
机译:我们提出了一个在生成模型中耦合自然语言句子的语法和语义的框架,以便开发一种语义分析器,以在背景知识的指导下共同推断给定句子的句法,形态和语义表示。为了在我们的框架中产生共识,首先要从先验中抽取语义陈述,例如从知识库中的一组信念中抽取语义陈述。给定这种语义陈述,语法就可能生成输出句子。派生一个联合语义语法分析器,该语法分析器返回给定句子的$ k $-最佳语义和语法分析。语义先验是灵活的,并且可以以不同于先前语义解析方法的方式用于在解析过程中并入背景知识。例如,与知识库中的信念相对应的语义陈述可以被赋予较高的先验概率,可以为类型正确的语句提供较低的概率,并且可以为知识库以外的信念赋予较低的概率。我们语法的构造调用了层次化Dirichlet流程(HDP)的新颖应用,而这又需要一种新颖而有效的推理方法。我们提供的实验结果表明,对于一个简单的语法,我们的解析器的性能优于最先进的CCG语义解析器,并且可以扩展到具有数百万个信念的知识库。
展开▼